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(g) Vascular endothelial cell growth factor C subunit 

@ Vascular endothelial cell growth factor C sub- 
unit DNA is prepared by polymerase chain reac- 
tion techniques. The DNA encodes a protein 
that may exist as either a heterodimer or 
homodimer. The protein is a mammalian vascu- 
lar endothelial cell mitogen and as such is 
useful for the promotion of vascular develop- 
ment and repair. This unique growth factor is 
also useful in the promotion of tissue repair. 
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BRIEF DESCRIPTION OF THE DRAWING 

A * F W: F ?" Ie " 9th aminoacid residue P rotein translation product and its cDNAcoding sequence for VEGF 
AA subunrt A plus polypeptide cleavage products used to determine the amino acid sequence 
a» TVl F f aminoacid residue P rotein translation product and its cDNAcoding sequence for VEGF 
AB subunrt A plus polypeptide cleavage products used to determine the amino acid sequence 

AB subunrt B plus polypeptide cleavage products used to determine the amino acid sequence 

^gpTT« . CDN J ^ 8 T enC8 a " d *" ' enflth amin ° 8cid residue P ratein Elation Product for 
VEGF A 146 amino acid residue subunit SEQ ID NOS:23 & 33. 

VFPP^n a FU " CDN J e0d ! nQ SeqUe0Ce a " d M ' ength acid residue > Jrotein ^nslation product for 
VEGF A 190 amino acid residue subunit SEQ ID NOS:30 & 31. 

vrrPA 0 ^!' FU " ° DN ? °° din9 8eqUenCe 8nd M ,e " 9th 8min0 acid residue P rotein ^"slation product for 
VEGF A 214 amino acid residue subunit SEQ ID NOS:34 & 35. 

v™"?,!' FU " ° DN ^ COdi " 9 8eqUenCe 3nd M ,en9th amino add residue P rotein translation product for 
VEGF B 1 38 amino acid residue subunit SEQ ID NOS:36 & 37. 

xinrct?.!' FU " CDNA °° din9 S6qUenCe 3nd M le " 9th amino acid residue P™ 1 *" translation product for 
VEGF B 1 58 amino acid residue subunit SEQ ID NOS:38 & 39. 

vc/c^?*?' FU " CDNA ° 0din9 SeqUenC8 aPd fU " ,ength amino acid residue P rotein translation product for 
VEGF C 1 54 amino acid residue subunit SEQ ID NOS:40 & 41. 

BACKGROUND OF THE INVENTION 

A new class of cell-derived dimeric mitogens with apparently restricted specificity for vascular endothelial 
ce Is has recently been identified and generally designated vascular endothelial growth factors (VEGFs) The 
mitogen has been purified from: conditioned growth media of rat glioma cells, [Conn et al.. Proc. Nad Acad 
Sc.. USA §7: 1323-1327 (1990)]: conditioned growth media of bovine pituitary folliculo stellate cells Ferrara 
T n«1 Z fi J^ 8 ,^ Phy8, ReS - C ° mm - m 851 " 858 (1989 > and Gospodarowicz et al.. Proc. Natl. Acad 
2^ I" I ( mi *" end0thelial ce " 9 rowth fac tor isolated form mouse myoblastoma cell line 
NB41 w.th an unreduced molecular mass of 43-51 kDa and a reduced mass of 23-29 kDa has been described 
by Levy et al.. Growth Factors 2: 9-19 (1989). Connolly et al. (J. Biol. Chem. 264: 20017-20024 [19891- J Clin 

Tn T I 4 ? 1478 11 9891) d68Cribe 3 human vascular Permeability factor^ stimulates vascular endoJ 
hal cells tod.v,de in yjtro and promotes the growth of new blood vessels when administered into healing rabbit 

iTI™ Z T ^T, 38 - *" endothelial ce " 9 rowth factor •» been purified from the conditioned medium of 
meAtT^Oprtuitary cell line by Plouetetal.. EMBO Jownal 8: 3801-3806(1989). The growth factor was charac- 
^no 8 ?^ ^mercomposedof subuntewith molecularmassof23kDa. Leungetal. (Science246: 1306- 

2628-2632 [1990]) have described cDNAs which encode VEGF A which is homologous to the A and B chain's 
of platelet-denved growth factor. Vascular endothelial growth factor I (VEGF I. VEGF AA) is a homodimer with 
an apparent molecular mass of 46 kDa. with each subunit having an apparent molecular mass of 23 kDa VEGF 
I has distinct structural similarities to platelet-derived growth factor (PDGF), a mitogen for connective tissue 
cells but not vascular endothelial cells from large vessels. 

OBJECTS OF THE INVENTION 

i. " .f '^?f 3 I din9ly ' a " ° bjeCt * the present inventi °n to provide novel vascular endothelial growth factor C 
subunit DNA free of other mammalian DNA. Another object is to provide recombinant genes capable of exp- 
ressing VEGF C subunit monomer or dimer. Another object is to provide vectors containing the DNA sequences 
£ f nu a C T / S " bunits - A further is to P™ d e a host cell transformed with a vector containing 

to > DNA sequence for VEGF A or B plus C or VEGF C alone. It is also an object to provide a recombinant pro- 
cess for making VEGF C subunit. Another object is to provide a novel vascular endothelial cell growth factor 
which contains the C subunit. This may include heterodimers AC and BC and homodimer CC. 

SUMMARY OF THE INVENTION 

Va rl CU l a [,! nd0thelial Ce " 9TOWth fact0r C subunit DNA is P fe P ared bv Polymerase chain reaction techni- 
ques. The DNA encodes a protein that may exist as either a heterodimer or homodimer. The protein is a mam- 
malian vascular endothelial cell mitogen and as such is useful for the promotion of vascular development and 
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repair. This unique growth factor is also useful in the promotion of tissue repair. 
DETAILED DESCRIPTION 

The present invention relates to a unique vascular endothelial cell growth factor (designated VEGFV .so- 
lated and purified from glioma cell conditioned medium, which exhibits mitogenic stimulate of vascular en- 
dothelial cells. Glioma is defined herein as any neoplasm derived from one of the various types of cells tna. 
form the interstitial tissue of the central nervous system including brain, sp,nal cord, posterior pituitary gianc 
and retina Consequently, the scope of the present invention is intended to include tne unique growtn .actor 
isolated and purified from any mammalian tissue or other cells including cel. lines. 
not limited to. glioma-derived cell lines such as C6. hs 683 and GS-9L glioblastomas such as A-172 and T98G. 
neuroblastomas such as IMR-32 and SK-N-MC; neurogliomas such as H4; tetromas such as XB-2; 
astrocytomas such as U-87 MG and U-373 MG; embryonal carcinomas and non-transformed glial oi ^astrocyte 
S lTnes and the human myoblastoma line TE 671, with GS-9L and TE 67! bemg preferred. ,s 
present and can be isolated from rat tissue including ovary, heart and kidney. Anterior pituitary tumor cell ..nes 
such as GH3 and Hs 199 may also be used. It is intended that VEGF of this invention can be obtained from 
any mammal species capable of producing VEGF, this includes, but is not limited to. rat and numan. 

Vascular endothelial cell growth factor may exist in various microheterogeneous forms which are isolated 
from one or more of the various cells or tissues described above. Microheterogeneous forms as used herein 
refer to a single gene product, that is a peptide produced from a single gene unit of DNA, which is structurally 
modified at the mRNA level or following translation. Peptide and protein are used interchangeably herem. The 
microheterogeneous forms will all have similar mitogenic activities. Biological activity and b,olog,cally active 
Re used interchangeably and are herein defined as the ability of VEGF to stimulate DNA syndesis in targe 
cells including vascular endothelial cells as described below which results in cell proliferation. The modrf cafions 
may take place either in vivo, or dunng tne isolation and purification process. In vivo modification results from, 
but is not limited to, P ro.e~o^s,s. glycosy.ation. phosphorylation, deamidation or acetylation at the ■ 
Proteolysis may include exoproteolysis wherein one or more terminal amino acids are sequentially enzymat, 
cally cleaved to produce microheterogeneous forms which have fewer amino acids than the ong.nal gene pro- 
duct Proteolysis may also include endophytic modtfication that results from the action of endoproteases 
which cleave the peptide at specific locations within the amino acid sequence. Similar modifications can occur 
during the purffication process which also results in production of microheterogeneous forms. The most com- 
m0 n modification occurring during purification is proteolysis which is generally held to a minimum by the use 
of protease inhibitors. Under most conditions one or more microheterogeneous forms are present follow ng puri- 
fication of native VEGFs. Native VEGFs refers to VEGF isolated and purified from cells that produce VEGFs 
Vascular endothelial cell growth factor may also exist in various alternatively spliced forms which ,. defined 
herein as the production of related mRNAs by differential processing of exons and introns. Exons are defined 
as those parts of the DNA sequence of a eukaryotic gene that code for the final protein product. It is also in- 
tended that the present invention includes VEGF subunits A.B and C which are defined as comprising the fuN 
length translation products of all alternatively spliced mRNAs made from the gene encoding the subunits and 
their corresponding mature amino acid sequences generated by proteolytic removal of the amino terminal sec- 
retory leader amino acid sequences. It is further intended that the invention only include those micro- 
heterogeneous and alternatively spliced VEGF subunits which when in the dimeric form exhibit b.olog.cal 
activity such as vascular endothelial cell stimulation as discussed below. ,, 7K/ . m2 
Glioma cells such as the rat cell line GS-9L are grown to confluence in tissue culture flasks, about 175 cm 
in a cell culture medium such as Dulbecco's Mod,fied Eagle's Medium (DMEM) supplemented w,th about 10 4 
newborn calf serum (NCS). When the cells reach confluence the culture medium is removed, the cell layers 
are washed with Ca~, Mg"-free phosphate buffered saline (PBS) and are removed from the flasks by treatment 
w h a so utl of trypsin 9 about 0.1%,and EDTA. about 0.04%. The cells, about 1 * 10- are pellete by cen- 
Lgation. resuspended in about 15 00 ml of DMEM containing about 5% NCS and plated .nto a ten level ceH 
factory (NUNC). 6,000 cm> surface area. The cells are incubated for about 48 to about 96 hours, w.th 72 hours 
preferred, at about 37' C in an atmosphere of about 5% C0 2 . Following incubation the ^ ediljm ^ r ^f a d .^^ 
the cell factories are washed about 3 times with PBS. About 1500 ml of fresh cul^e med.a ,s adde con a ng 
about a 1:2 mixture of Ham's-F12/DMEM containing about 1 5 mM Hepes. pH about 7.4 aoout 5 ug/ml msuha 
about 1 0 ug/ml transferrin and with or without about 1 .0 mg/ml bovine serum albumin. Th,s medium i , « rep .laced 
with fresh medium after about 24 hr and collected every 48 hr thereafter. The collected conditioned medium . 
filtered through Whatmen#1 paper to remove cell debris and stored at about -20" C. 

The GS-9L conditioned medium is thawed and brought to P H 6.0 with 1 M HCI. The initial punfication step 
consists of cation exchange chromatography using a variety of cation exchangers on a variety of matrices such 
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Zn*JTr t*Z? 0, Phamiacia S. Zetachrom SP and Polyaspartic Acid WCX (Nest Group) with CM 

222^ r ™ bei "9 P referred - T*» VEGF<ontaining culture medium " ^ miSd wl cM 

th 3t a6 ° Ut 2 9m ab0Ut 20 L ° f "» a "< at low pZt aCt 24 

■ a h ^ iS a " 0Wed to Settle 3nd 11,9 8Xcess * "»«-• ™« re^in sZTpa^ o a 

• column and the remaining culture medium is removed. Unbound protein is washed from 17^. J « J 

M sodium phosphate, about P H 6.0. containing 0.15 M Nad. SSSS E£13S a^O o^Tod 
phosphate, about pH 6.0, containing about 0.6 M NaCI M SOd,um 

gluttr ^ ^ NaC ' buffWCOntai "^ ab °* 0 M a-meth* mannoside and about 5 28 M me^ 

The VEGF AB active eluate from the Con-A column is applied to a PolvasDartic Acid wry h 
high performance liquid chromatography (HPLC) column 4 6 mm x 25 Tn fTJLTr^, ? Ch3n9e 
sodium phosphate buffer. pH BjFniZLn Je^w.h aZear g^m a Z to M N^O inT 
£osphate buffer over about 60 minutes. The flow rate is maintains 7Z^S%^£^ 0 tVS 

S^sssr helial ceN 9rawth factor ab act ^ is present in e,r 9 r r;^: 

to aJout STo'^E^^ WCX C0 ' Umn th3t COntain VEGF AB are P 001 ^. adjusted 

£L« If ™ an ; i ,oa J ied ° nto a 1 x 10 cm column of Pharmacia Chelating Sepharose 6B charged with an 

Tm M o Tf <M " jilibrated in ab °* 005 M sodium phosphate, about pH 7 0 wnSo about 

2 M NaCI and about 0.5 mM imidazole (A buffer). VEGF AB is eluted from the column wftha grad^ n^^m 0 20V 

b S J72 m 2 £ 35% B h ove L 45 minutes and *- 1 00% B over 5 minutes * a 2i SS 3 t££ w e * 

B buffer ,s 0.05 M sod.um phosphate. pH 7.0, containing about 2 M NaCI and 100 mM imidazote The *ZZ 
T^TZ 9 ^ GF AB ^ ^ b6,Ween ab0Ut 12 ' 6 and 228 * - me £E£^££ 
4.6 mm xTcm v^dac " S C °' 1,ainin9 " ^ ,he "»« Che,ate « ^ 

acid e ™i P S! e ^ PLC ""I""] 5 ^ Partide Si26) PreVi0USly ^"i'frated in solvent A [0.1% trifluoroacetic 
fa T, S Th f 1 C R ° lumn IS eluted wth a ,inear Sradient of about 0 to 30% solvent B over 15 minutes 30% B 

vrM^t; in i^.b/o crosshnked gels using the technique of Laemmli Nature 227*ean fiftA mot™ tL i 

n»M Mh Hark, b.l.no.d s al, conalning .boo, 20 mM H„«. .boo. phTTm SoSm 

dciaic norooiast growth factor, does not enhance mitogenic stimulation of these cells by VEGF AB. 
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A purified about 1-2 ug sample of VEGF AB is reduced in about 0.1 M Tito, about pH 9.5 wrth about 0.1 /. 
EDTAabout6Mguanidinium chloride and about 20mMdith«^ 

protein is carboxymethylated for about 1 hour in a solution containing about 9.2 nM of unlabeled and 2.8 uM 
of uc-iodoawticacid in about 0.7 M Tris, about pH 7.8. and about 0.1% EDTA and about 6 M guamd.n.um 
chloride. The protein is carboxymethylated for about 1 hr at room temperature. The protein is isolated after re- 
duction and carboxymethylation by reverse phase HPLC chromatography on a Vydac ^column, 
mm x 5 cm. The protein subunits are loaded onto a column pre-equilibrated with about 0.1% TFA and eluted 
by a 45 ml linear gradient from about 0.1% TFA to 0.1% TFA/67% acetonitrile at a bow rate of about 0.75 ml/mm. 
The reduced and carboxymethylated protein eluted as two peaks at approximately 23 and 25 ml with the pro- 
portion being approximately equal as determined by monitoring absorbance at 210 nm. 

Samples of the reduced and carboxymethylated monomers are applied to polybrene-coated glass fiber fil- 
ters and their N-terminal sequences are determined by Edman degradation in an ABI gas phase microsequ- 
encer in conjunction with an ABI 120A on line phenylthiohydantoin analyzer following the manufacturers 
instructions. The protein showing the peak of absorbance eluting at approximately 25 ml (A subunrt or monomer) 
yielded an amino terminal sequence of: SEQ ID NO:1 

Ala Pro Thr Thr Glu Gly Glu Gin Lys Ala His Glu Val Val w,,™, 
which is identical to the A chain monomers of VEGF AA. Conn et al.. Proc. Natl. Acad. Sa. USA 87, »2M«32 
(1990). The peak of absorbance eluting at approximately 23 ml (B subunit or monomer) y, elded an N-term.nal 
sequence of: SEQ ID NO:2 

Ala Leu Ser Ala Gly Asn Xaa Ser Thr Ser Thr Glu Met Glu Val Val 

Pro Phe Asn Glu Val . . 
plus a nearly equal amount of a truncated form of the same sequence missing the first three ammo acid resh 
dues. The missing Xxx residue corresponds to an Asn residue in the cloned cDNA see Mow Sine, th.s mis- 
sing Asn occurs in a classical Asn Xxx Ser/Thr N-glycosylation sequence it is presumed to be glycosylated. 
The A subunit and the total of both B subunits are recovered in nearly equal amounts supporting the interpre- 
tation that the two peptides combine to form an AB heterodimer in VEGF AB. 

A sample of the A monomer was treated wth either the protease trypsin which cleaves polypeptides on 
the C-terminal side of lysine and arginine residues or Lys C which cleaves polypeptides on the germinal side 
of lysine by procedures well known in the art. The peptides are isolated by reversed phase - HPLC(RP-H PLC) 
The amino acid sequences of the isolated peptides are determined using the Edman degradation in the ABI 
gas phase sequenator in conjunction with the AB1 120 A on line phenylthiohydantoin analyzer following man- 
ufacturer's instructions. The amino acid sequences are shown in Figure 1. 

Reduced and carboxymethylated A monomer is dried and solubilized in about 0.7 M Tris, about pH 7.8. 
about 6 M guanidinium chloride containing about 0.1% EDTA V8 protease is added in 0. 1 M ammonium bicar- 
bonate buffer, about pH 8.0. and the mixture is incubated for about 48 hr at about 37°C. The protease cleaves 
predominantly on the carboxyl terminal side of glutamic acid residues. The resulting polypeptides were resolved 

by C,« RP-HPLC as above. .kcm 
The reduced and carboxymethylated A subunit protein solution is adjusted to a pH of about 6.8 1 with 6 N 
HCI and dithiotreitol is added to a final concentration of 2 M for reduction of any methionine sutoude to 
methionine residues. After about 20 hr of reduction at about 39°C the protein is repunfied by C 4 HPLC. The 
product is dried and cleaved on the carboxyl terminal side of methionine residues by 200 ul of 40 mM cyanogen 
bromide in about 70 % (v/v) formic acid under an argon atmosphere at about 20°C for about 24 hr in the dark. 
The cleavage products are resolved by C„ RP-HPLC. The amino acid sequence is shown in Figure 1 , see Conn 
et al.. Proc. Natl. Acad. Sci USA 87:2628-2632 (1990). 

The full length 1 90 amino acid residue protein translation product of the VEGF AB, A monomer or subunit. 
which is now known to be identical with the VEGF AA. A monomer, and its cDNA coding sequence are shown 
in Figures 2 and 6. The mature amino terminus begins at residue 27. immediately following a typical hydro phobic 
secretory leader sequence. A single potential N-glycosylation site exists at Asn 100 . Most (143 amino and resi- 
dues) of the 164 residues of the reduced and carboxymethylated mature subunit including the amino terminus 
and HPLC reversed phase-purrfied products of tryptic (T), Lys-C (L). Staphylococcus aureus V8 protease (V8) 
and cyanogen bromide (CB) cleavages, were determined by direct microsequencing (Applied Biosystems 
470A) using a total of 5 ug of protein. All residues identified by amino acid sequencing are denoted by arrows 
pointing to the right either directly beneath the mature processed sequence following the bracket at residue 27 
for the amino terminal determination of the whole subunit or. for residues identified from the polypeptide cleav- 
age products, above the double-headed arrows spanning the length of the particular polypeptide. One Mod 
pair of polypeptides. V18A and V1 8B, was sequenced as a mixture and. therefore, are only confirmatory of the 
cDNA-deduced amino acid sequence, see Figures 1 and 5. 

Samples of the reduced and carboxymethylated pure VEGF AB, A and B monomers, were each digested 
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(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 
TTTGTCGACA CTGCACTGTG TGCCGGTG 2 8 

10 

(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

is (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 
TTTGTCGACA ACATTGGCCG TCTCCACC 2 8 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
TTTGTCGACA ATCGCCGCAG CAGCCGGT 28 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 
- (A) LENGTH: 27 base pairs 
(B) TYPE: nucleic acid 
< C ) STRANDEDNESS : s ingle 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 
TTTGTCGACT GGCTCTGGAC GTCTGAG 



(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 
TTTGTCGACA CTGAAGAGTG TGACGG 26 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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10 



25 



30 



35 



(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28 

CAG AGA CCC ACA GAC TGC CAC CTG TGC GGC GAT 
GCT GTT 39 



CCC CGG AGG TAA 



51 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 

(B) TYPE: amino acids 

15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Gin Arg Pro Thr Asp Cys His Leu Cys Gly Asp 

zo Ala Val 

5 10 

Pro Arg Arg 
15 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 577 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

AACC 4 

ATG AAC TTT CTG CTC TCT TGG GTG CAC TGG ACC CTG GCT TTA CTG 4 9 
Met Asn Phe Leu Leu Ser Trp Val His Trp Thr Leu Ala Leu Leu 
5 10 15 



CTG TAC CTC CAC CAT GCC AAG TGG TCC CAG GCT GCA CCC ACG ACA 94 

Leu Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala Pro Thr Thr 
40 20 25 30 

GAA GGG GAG CAG AAA GCC CAT GAA GTG GTG AAG TTC ATG GAC GTC 139 

Glu Gly Glu Gin Lys Ala His Glu Val Val Lys Phe Met Asp Val 
35 40 40 

45 TAC CAG CGC AGC TAT TGC CGT CCG ATT GAG ACC CTG GTG GAC ATC 164 

Tyr Gin Arg Ser Tyr Cys Arg Pro lie Glu Thr Leu Val Asp lie 
50 55 60 

TTC CAG GAG TAC CCC GAT GAG ATA GAG TAT ATC TTC AAG CCG TCC 229 

50 Phe Gin Glu Tyr Pro Asp Glu He Glu Tyr He Phe Lys Pro Ser 

65 70 75 
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Cvl S? f CC ATG CGG TGT GCG GGC TGC TGC AAT GAT GAA GCC 274 
Cys val Pro Leu Met Arg Cy 3 Ala Gly Cys Cys Asn Asp Glu Ala 
80 85 go 

2u r GC » CC ACG TCG 6,15 AGC GTC ACT AT = CAG ATC 319 

Leu Glu Cys Val Pro Thr Ser Glu Ser Asn Val Thr Met Gin u e 

95 100 los 

mI? »™ f CT ^ AGC 01(3 CAC AT * GGA GAG ATG AGC 364 

Met Arg lie Lys Pro His Gin Ser Gin His He Gly Glu Met Ser 

110 115 120 

TTC CTG CAG CAT AGC AGA TGT GAA TGC AGA CCA AAG AAA GAT AGA 405 
Phe Gin His Ser Arg Cys Glu Cys Arg Pro Lys Lys Asp Ara 
125 130 13 ' 5 

ACA AAG CCA GAA AAT CAC TGT GAG CCT TGT TCA GAG CGG AGA AAG 4 54 
Thr Lys Pro Glu Asn His Cys Glu Pro Cys Ser Glu Arg Arg Lys 
140 145 iso 

His 25 Phi ^ GAT CCG * CG TGT *** TGT TCC T <* AAA 4 99 

His Leu Phe Val Gin Asp Pro Gin Thr Cys Lys Cys Ser Cys Lys 

155 160 165 

AAC ACA GAC TCG CGT TGC AAG GCG AGG CAG CTT GAG TTA AAC GAA 54 4 
Asn ,hr Asp Ser Arg Cys Lys Ala Arg Gin Leu Glu Leu Asr. Glu 
170 175 180 

CGT ACT TGC AGA TGT GAC AAG CCA AGG CGG TGA * 77 
Arg Thr Cys Arg Cys Asp Lys Pro Arg Arg 
185 190 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 190 amino acids 

(B) TYPE: amino acids 

(C) STRANDED NESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 

Met Asn Phe Leu Leu Ser Trp Val His Trp Thr Leu Ala Leu Leu 
5 10 15 

Leu- Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala Pro Thr Thr 
20 25 30 

Glu Gly Glu Gin Lys Ala His Glu Val Val Lys Phe Met Asp Val 
35 40 45 

Tyr Gin Arg Ser Tyr Cys Arg Pro He Glu Thr Leu Val Asp lie 
50 55 6 o 

Phe Gin Glu Tyr Pro Asp Glu He Glu Tyr He Phe Lys Pro Ser 
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65 70 75 

Cys Val Pro Leu Met Arg Cys Ala Gly Cys Cys Asn Asp Glu Ala 
5 80 85 90 

Leu Glu Cys Val Pro Thr Ser Glu Ser Asn Val Thr Met Gin He 
95 100 105 

Met Arg He Lys Pro His Gin Ser Gin His He Gly Glu Met Ser 
10 no 115 120 

Phe Leu Gin His Ser Arg Cys Glu Cys Arg Pro Lys Lys Asp Arg 
125 130 135 



15 



20 



25 



30 



35 



40 



45 



50 



Thr Lys Pro Glu Asn His Cys Glu Pro Cys Ser Glu Arg Arg Lys 
140 145 150 

His Leu Phe Val Gin Asp Pro Gin Thr Cys Lys Cys Ser Cys Lys 
155 160 165 

Asn Thr Asp Ser Arg Cys Lys Ala Arg Gin Leu Glu Leu Asn Glu 
170 175 180 

Arg Thr Cys Arg Cys Asp Lys Pro Arg Arg 
185 190 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

ACCA 4 

ATG AAC TTT CTG CTC TCT TGG GTG CAC TGG ACC CTG GCT TTA CTG 4 9 
Met Asn Phe Leu Leu Ser Trp Val His Trp Thr Leu Ala Leu Leu 
5 10 15 

CTG TAC CTC CAC CAT GCC AAG TGG TCC CAG GCT GCA CCC ACG ACA 94 
Leu Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala Pro Thr Thr 
20 25 30 

GAA GGG GAG CAG AAA GCC CAT GAA GTG GTG AAG TTC ATG GAC GTC 139 
Glu Gly Glu Gin Lys Ala His Glu Val Val Lys Phe Met Asp Val 
35 40 45 

TAC CAG CGC AGC TAT TGC CGT CCG ATT GAG ACC CTG GTG GAC ATC 184 
Tyr Gin Arg Ser Tyr Cys Arg Pro lie Glu Thr Leu Val Asp He 
50 55 60 

TTC CAG GAG TAC CCC GAT GAG ATA GAG TAT ATC TTC AAG CCG TCC 229 
Phe Gin Glu Tyr Pro Asp Glu He Glu Tyr He Phe Lys Pro Ser 
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65 70 75 

TGT GTG CCC CTA ATG CGG TGT GCG GGC TGC TGC AAT GAT GAA GCC 274 
s Cys val Pro Leu Met Arg Cys Ala Gly Cys Cys Asn Asp Glu Ala 

80 85 go 

CTG GAG TGC GTG CCC ACG TCG GAG AGC AAC GTC ACT ATG CAG ATC 3' 9 
Leu Glu Cys Val Pro Thr Ser Glu Ser Asn Val Thr Met Gin lie 
10 95 100 105 

ATG CGG ATC AAA CCT CAC CAA AGC CAG CAC ATA GGA GAG ATG AGC 364 
Met Arg He Lys Pro His Gin Ser Gin His lie Gly Glu Met Ser 

110 U5 120 

TTC CTG CAG CAT AGC AGA TGT GAA TGC AGA CCA AAG AAA GAT AGA 4 09 
Phe Leu Gin His Ser Arg Cys Glu Cys Arg Pro Lys Lys Asp Arg 
125 130 13 5 

ACA AAG CCA GAA AAA TGT GAC AAG CCA AGG CGG TGA 445 
Thr Lys Pro Glu Lys Cys Asp Lys Pro Arg Arg 
140 145 



15 



20 



25 



30 



35 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Met Asn Phe Leu Leu Ser Trp Val His Trp Thr Leu Ala Leu Leu 

5 10 15 

Leu Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala Pro Thr Thr 
20 25 30 

Glu Gly Glu Gin Lys Ala His Glu Val Val Lys Phe Met Asp Val 
35 4 o 45 

Tyr Gin Arg Ser Tyr Cys Arg Pro He Glu Thr Leu Val Asp He 
50 55 60 

Phe Gin Glu Tyr Pro Asp Glu lie Glu Tyr He Phe Lys Pro Ser 
65 70 75 

Cys Val Pro Leu Met Arg Cys Ala Gly Cys Cys Asn Asp Glu Ala 
*0 85 90 

Leu Glu Cys Val Pro Thr Ser Glu Ser Asn Val Thr Met Gin He 
95 100 105 

Met Arg He Lys Pro His Gin Ser Gin His He Gly Glu Met Ser 
so 110 115 120 



40 



45 
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Phe Leu Gin His Ser Arg Cys Glu Cys Arg Pro Lys Ly, Asp Arg 
125 130 

Thr Lys Pro Glu Lys Cys Asp Lys Pro Arg Arg 
140 145 

(2) INFORMATION FOR SEQ ID NO:34 ;_ T ^ e 
1 ' (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 649 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ 
ID NO: 34: 

4 

AACC 

iTr AAC TTT CTG CTC TCT TGG GTG CAC TGG ACC CTG GCT TTA CTG 4 9 
S £n III Leu S Ser Trp Val His Trp Tnr Leu Ala Leu Leu^ 



5 



S S 25 35 Sj S Z 5 S Sj S 2 £ £ E 91 

ss a s = s s = s s a s = = s s 18< 

5Q, 55 

TTC CAG GAG TAC CCC GAT GAG ATA GAG TAT ATC TTC AAG CCG TCC 229 
Phe Sn Su Tyr Pro Asp Glu lie Glu Tyr He Phe Lys Pro Ser 

65 70 
~, rrr <-TA ATG CGG TGT GCG GGC TGC TGC AAT GAT GAA GCC 274 

Z vl! Pro S Met S Cys Ala Gly Cya Cy, A,n A 3 p Glu Ala 



80 



CTG GAG TGC GTG CCC ACG TCG GAG AGC AAC GTC ACT ATG CAG ATC 319 
Su Su Cys Sal Pro Thr Ser Glu Ser Asn Val Thr Met Gin XI. 

95 100 

110 115 

TTC CTG CAG CAT AGC AGA TGT GAA TGC AGA CCA AAG AAA GAT AGA 4 09 
III 2u Sn Ser Arg Cy* Glu Cy, Arg Pro Ly, Lys Asp Arg 

125 130 
ACA AAG CCA GAA AAA AAA TCA GTT CGA GGA AAG GGA AAG GGT CAA 454 
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Thr ly 3 Pro Glu , Ly9 ser yal Arg QXy l ^ ^ y wn 

145 150 
AAA CGA AAG CGC AAG AAA TCC CGG TTT AAA TCC TGT 

Lys Arg Lys Arg L? Ly3 ser Atg p J £ «c agc err eje «99 

155 165 

SI SS 2 SI 2 2 S - - « 2 S 52 Si 2 5 " 



170 175 

2 2 s si s si is s; s 2 £ - » - ~ ». 

2 2 2 S! 2 2 2 2 2 2 £ « ~ - « - 

° 205 210 

AAG CCA AGG CGG TGA 

Lys Pro Arg Arg 649 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS' 

(A) LENGTH: 214 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO- 35- 



Met Asn Phe Leu Leu Ser Trp Val Hi 3 Trp Thr Leu Ala Leu Leu 
5 10 15 

Leu Tyr Leu Hi, His Ala Lys Trp Ser Gin Ala Ala Pro Thr Thr 

20 25 



30 



Glu Gly Glu Gin Lys Ala His Glu Val Val Lys Phe Met Asp Val 

35 «0 45 

Tyr Gin Arg Ser Tyr C y3 Arg Pro lie Glu Thr Leu Val Asp He 

50 55 60 

Phe Gin Glu Tyr Pro Asp Glu lie Glu Tyr He Phe Lys Pro Ser 

" ™ 75 

Cys Val Pro Leu Met Arg Cy, Ala Gly Cys Cys Asn Asp Glu Ala 

80 « 90 

Leu Glu Cys Val Pro Thr Ser Glu Ser Asn Val Thr Met Gin He 

95 mn 



Met Arg He Lys Pro Hi 3 Gin Ser Gin Hi* He Gly Glu Met Ser 
110 115 



100 105 

Ser 
120 
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Phe Leu Gin His Ser Arg Cys Glu Cys Arg Pro Lys Lys Asp Arg 

130 135 



125 



Thr Lys Pro Glu Lys Lys Ser Val Arg Gly Lys Gly Lys Gly Gin 
140 145 ^ 

Lys Arg Lys Arg Lys Lys Ser Arg Phe Lys Ser Trp Ser Val His 
155 160 165 

Cys Glu Pro Cys Ser Glu Arg Arg Lys His Leu Phe Val Gin Asp 
170 175 18C 

Pro Gin Thr Cys Lys Cys Ser Cys Lys Asn Thr Asp Ser Arg Cys 
185 lBb 

Lys Ala Arg Gin Leu Glu Leu Asn Glu Arg Thr Cys Arg Cys Asp 
200 205 210 

Lys Pro Arg Arg 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

ATG CTG GCC ATG AAG CTG TTC ACT TGC TTC TTG CAG GTC CTA GCT A 5 
Met Leu Ala Met Lys Leu Phe Thr Cys Phe Leu Gin Val Leu Ala 

10 15 



5 



GGG TTG GCT GTG CAC TCC CAG GGG GCC CTG TCT GCT GGG AAC AAC 90 
Gly Leu Ala Val His Ser Gin Gly Ala Leu Ser Ala Gly Asn Asn 
20 25 30 

TCA ACA GAA ATG GAA GTG GTG CCT TTC AAT GAA GTG TGG GGC CGC 135 
Ser Thr Glu Met Glu Val Val Pro Phe Asn Glu Val Trp Gly Arg 
35 40 45 

AGC TAC TGC CGG CCA ATG GAG AAG CTG GTG TAC ATT GCA GAT GAA 180 
Ser Tyr Cys Arg Pro Met Glu Lys Leu Val Tyr He Ala Asp Glu 



50 



CAC CCT AAT GAA GTG TCT CAT ATA TTC AGT CCG TCA TGT GTC CTT 22 5 
His Pro Asn Glu Val Ser His He Phe Ser Pro Ser Cys Val Leu 
65 70 75 

CTG AGT CGC TGT AGT GGC TGC TGT GGT GAC GAG GGT CTG CAC TGT 27 0 
Leu Ser Arg Cys Ser Gly Cys Cys Gly Asp Glu Gly Leu His Cys 
80 85 90 
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10 



15 



40 



45 



S? 2 2 £ s s s s S K K s Si s S '» 

95 100 105 

CCC CCC AAT CGG GAT CCA CAT rrr ~. 

.» «. J, s si E £ «• «» « « « in >« 

115 

SJ 5 S 2 £ S SS 5 5 ill 2 £ £ ™ 

130 135 

GCA GAA AGG TAA 

Ala Glu Arg 417 

(2) INFORMATION FOR SEQ ID NO:37- 

(i) SEQUENCE CHARACTERISTICS • 

!i! SSJ TH: 138 amino acid s 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
<xa) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Met Leu Ala Met Lys Leu Phe Thr Cys phe ^ ^ ^ ^ 

5 io 15 

Gly Leu Ala v al His Ser Gin Gly Ala Leu Ser Ala Gly Asn Asn 

25 30 

Ser Thr Glu Met Glu Val Val dk^ * 

30 35 al Val Pro Ph€ A « Glu Val Trp Gly Arg 



20 



Ser ryr Cys Arg P „ Met Glu Lys Le u V al Tyr lie Ala Asp Glu 

55 60 

His Pro Asn Glu Val Ser ok. ^ 

35 65 r ° Ser C * 3 Val L eu 

70 75 

Leu Ser Arg Cy 3 Ser Gly Cy S Cy s Gl y A9p Glu Gly Leu His Cys 

85 90 

V.l Al. Leu Ly S Thr Ala Asn lie Thr Met Gin He Leu Lys He 

" 10 ° 105 

Pro Pro Asn Arg Asp Pro His Ser Tyr Val Glu Met Thr Phe Ser 

115 120 

Gin Asp Val Leu Cys Gl u C ys Arg Pro lie Leu Glu Thr Thr Lys 

130 135 
Ala Glu Arg 

so (2) INFORMATION FOR SEQ ID NO: 38' 

(i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 477 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

ATG CTG GCC ATG AAG CTG TTC ACT TGC TTC TTG CAG GTC CTA GCT 4 5 
Met Leu Ala Met Ly 3 Leu Phe Thr Cys Phe Leu Gin Val Leu Ala 
5 10 13 

GGG TTG GCT GTG CAC TCC CAG GGG GCC CTG TCT GCT GGG AAC AAC 9C 
Gly Leu Ala Val His Ser Gin Gly Ala Leu Ser Ala Gly Asn Asn 
20 2 5 30 

TCA ACA GAA ATG GAA GTG GTG CCT TTC AAT GAA GTG TGG GGC CGC 135 
Ser Thr Glu Met Glu Val Val Pro Phe Asn Glu Val Trp Gly Arg 
35 40 * 3 

AGC TAG TGC CGG CCA ATG GAG AAG CTG GTG TAC ATT GCA GAT GAA 13C 
Ser Tyr Cys Arg Pro Met Glu Lys Leu Val Tyr lie Ala Asp Glu 
50 55 

CAC CCT AAT GAA GTG TCT CAT ATA TTC AGT CCG TCA TGT GTC CTT 22 5 
His Pro Asn Glu Val Ser His lie Phe Ser Pro Ser Cys Val Leu 
65 7 ° 75 

CTG AGT CGC TGT AGT GGC TGC TGT GGT GAC GAG GGT CTG CAC TGT 27 0 
2u Ser Arg Cys Ser Gly Cys Cys Gly Asp Glu Gly Leu His Cys 
80 85 

GTG GCG CTA AAG ACA GCC AAC ATC ACT ATG CAG ATC TTA AAG ATT 315 
val Ala Leu Lys Thr Ala Asn lie Thr Met Gin lie Leu Lys lie 

95 100 iu= 



CCC CCC AAT CGG GAT CCA CAT TCC TAC GTG GAG ATG ACA TTC TCT 3S0 
Pro Pro Asn Arg Asp Pro His Ser Tyr Val Glu Met Thr Phe Ser 

no I 15 

CAG GAT GTA CTC TGC GAA TGC AGG CCT ATT CTG GAG ACG ACA AAG 4 05 
Gin Asp Val Leu Cys Glu Cys Arg Pro lie Leu Glu Thr Thr Lys 

125 130 AJ3 

GCA GAA AGG AGG AAA ACC AAG GGG AAG AGG AAG CAA AGC AAA ACC 450 
Ala Glu Arg Arg Lys Thr Lys Gly Lys Arg Lys Gin Ser Lys Thr 

140 

All 

CCA CAG ACT GAG GAA CCC CAC CTG TGA 
Pro Gin Thr Glu Glu Pro His Leu 
155 

(2) INFORMATION FOR SEQ ID NO: 39: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 158 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO • 39 



Met Leu Ala Met Lys Leu Phe Thr Cys Phe Leu Gin Val Leu Ala 
5 10 15 

Gly Leu Ala Val His Ser Gin Gly Ala Leu Ser Ala Gly Asr. Asn 
20 25 30 

Ser Thr Glu Met Glu Val Val Pre Phe Asn Glu Val Trp Gly Arg 
35 40 45 

Ser Tyr Cys Arg Pro Met Glu Lys Leu Val Tyr He Ala Asp Glu 
50 55 60 

His Pro Asn Glu Val Ser His lie Phe Ser Pro Ser Cys val Leu 
65 70 75 

Leu Ser Arg Cys Ser Gly Cys Cys Gly Asp Glu Gly Leu His Cys 
80 85 90 

Val Ala Leu Lys Thr Ala Asn lie Thr Met Gin He Leu Lys He 

95 100 los 

Pro Pro Asn Arg Asp Pro His Ser Tyr Val Glu Met Thr Phe Ser 
HO lis 120 

Gin Asp val Leu Cys Glu Cys Arg Pro He Leu Glu Thr Thr Lys 
125 130 i3 5 

Ala Glu Arg Arg Lys Thr Lys Gly Lys Arg Lys Gin Ser Lys Thr 
140 145 150 

Pro Gin Thr Glu Glu Pro His Leu 
155 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 465 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

ATG CCG GTC ATG AGG CTG TTC CCT TGC TTC CTG CAG CTC CTG GCC 
Met Pro Val Met Arg Leu Phe Pro Cys Phe Leu Gin Leu Leu Ala 
5 10 IS 



37 



EP 0 506 477 A1 



GGG CTG 
Gly Leu 

GCT GGG 
Ala Gly 

GTG TGG 
Val Trp 



GTC GTG 
Val Val 



TCC TGT 
Ser Cys 



AAT CTG 
Asn Leu 



CTC CTA 
Leu Leu 



ACG TTC 
Thr Phe 



GCG CTG CCT GCT GTG CCC CCC CAG CAG TGG 
Ala Leu Pro Ala Val Pro Pro Gin Gin Trp 
20 25 

AAC GGC TCG TCA GAG GTG GAA GTG GTA CCC 
Asn Gly Ser Ser Glu Val Glu Val Val Pro 
35 40 

GGC CGC AGC TAG TGC CGG GCG CTG GAG AGG 
Gly Arg Ser Tyr Cys Arg Ala Leu Glu Arg 
50 55 

TCC GAG TAC CCC AGC GAG GTG GAG CAC ATG 
Ser Glu Tyr Pro Ser Glu Val Glu His Met 
€5 70 

GTC TCC CTG CTG CGC TGC ACC GGC TGC TGC 
Val Ser Leu Leu Arg Cys Thr Gly Cys Cys 
80 85 

CAC TGT GTG CCG GTG GAG ACG GCC AAT GTC 
His Cys Val Pro Val Glu Thr Ala Asn Val 
95 100 

AAG ATC CGT TCT GGG GAC CGG CCC TCC TAC 
Lys lie Arg Ser Gly Asp Arg Pro Ser Tyr 
110 us 

TCT CAG CAC GTT CGC TGC GAA TGC CGG CCT 
Ser Gin His Val Arg Cys Glu Cys Arg Pro 
125 130 



GCC TTG 
Ala Leu 



TTC CAG 
Phe Gin 



CTG GTG 
Leu Val 



TTC AGC 
Phe Ser 



GGC GAT 
Gly Asp 



ACC ATG 
Thr Met 



GTG GAG 
Val Glu 



TCT 90 
Ser 
30 

GAA 135 
Glu 
45 

GAC 180 
Asp 
60 

CCA 225 
Pro 
75 

GAG 270 
Glu 
90 

CAG 315 

Gin 

105 

CTG 360 
Leu 
120 



AAG ATG 
Lys Met 

AGG AGA 
Arg Arg 



AAG CCG 
Lys Pro 



GAG AAG 
Glu Lys 



GAA AGG AGG AGA CCC AAG GGC AGG 
Glu Arg Arg Arg Pro Lys Gly Arg 
140 145 

TAG 



CTG CGG 
Leu Arg 



GGG AAG 
Gly Lys 



GAG 4 05 

Glu 

135 

AGG 4 50 

Arg 

150 

465 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Met Pro Val Met Arg Leu Phe Pro Cys Phe Leu Gin Leu Leu Ala 
5 10 15 

Gly Leu Ala Leu Pro Ala Val Pro Pro Gin Gin Trp Ala Leu Ser 
20 25 30 
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Ala Gly Asn Gly Ser Ser Glu Val Glu Val Val Pro Phe Gin Glu 
35 40 45 

Val Trp Gly Arg Ser Tyr Cys Arg Ala Leu Glu Arg Leu Val Asp 

50 55 60 

Val Val Ser Glu Tyr Pro Ser Glu Val Glu His Met Phe Ser Pro 
65 70 75 

Ser Cys Val Ser Leu Leu Arg Cys Thr Gly Cys Cys Gly Asp Glu 
80 85 90 

Asn Leu His Cys Val Pro Val Glu Thr Ala Asn Val Thr Met Gin 
95 100 105 

Leu Leu Lys lie Arg Ser Gly Asp Arg Pro Ser Tyr Val Glu Leu 
HO 115 120 

Thr Phe Ser Gin His Val Arg Cys Glu Cys Arg Pro Leu Arg Glu 
125 130 us 

Lys Met Lys Pro Glu Arg Arg Arg Pro Lys Gly Arg Gly Lys Arg 
140 145 150 

Arg Arg Glu Lys 



Claims 
1 



A purified and isolated DNA sequence encoding the C subunit of vascular endothelial cell growth factor. 
2. A purified and isolated vascular endothelial cell growth factor C subunit DNA sequence comprising: 

ATG OCG GTC ATG AGG CTG TTC CCT TGC TTC CTG CAG CTC CTG 
GCC GGG CTG GOG CTG CCT GCT GTG CCC 00C CAG CAG TGG GCC 
TTG TCT GCT GGG AAC GGC TCG TCA GAG GTG GAA GTG GTA CCC 
TTC CAG GAA GTG TGG GGC CGC AGC TAC TGC CGG GCG CTG GAG 
AGG CTG GTG GAC GTC GTG TCC GAG TAG CCC AGC GAG GTG GAG 
CAG ATG TTC AGC CCA TCC TGT GTC TOC CTG CTG CGC TGC ACC 
GGC TGC TGC GGC GAT GAG AAT CTG CAC TGT GTG COG GTG GAG 
ACG GCC AAT GTC ACC ATG CAG CTC CTA AAG ATC CGT TCT GGG 
GAC CGG CCC TCC TAC GTG GAG CTG AGG TTC TCT CAG CAC GTT 
CGC TGC GAA TGC CGG CCT CTG CGG GAG AAG ATG AAG OCG GAA 
AGG AGG AGA CCC AAG GGC AGG GGG AAG AGG AGG AGA GAG AAG 
TAG. SEQ ID NO: 40 

3 - "z^tr ,mm -* ac dna an a su6 - ■* - • <= 
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5. A purified and isolated vascular endothelial cell growth factor C subunit DNA sequence comprising: 
ATG CCG GTC ATG AGG CTG TTC CCT TGC TTC CTG CAG CTC CTG 

gcc gog ctg gog ctg cct gct gig ccc ccc cag cag tgg gcc 
ttg tct gct gog aac ggc tog tca gag gtg gaa gtg gta ccc 
ttc cag gaa gtg tgg ggc ogc aqc tac tgc cgg gcg ctg gag 
agg ctg gtg gac gtc gtg tcc gag tac ccc agc gag gtg gag 
cacatgttcagcccato:tgtgtctccctgct^ 

GGC TGC TGC GGC GAT GAG AAT CTG CAC TGT GTG CCG GTG GAG 
ACG GCC AAT GTC ACC ATG CAG CTC CTA AAG ATC CGT TCT GGG 
GAC CGG CCC TCC TAC GTG GAG CTG ACG TTC TCT CAG CAC GTT 
CGC TGC GAA TGC CGG CCT CTG CGG GAG AAG ATG AAG CCG GAA 
AGG AGG AGA CCC AAG GGC AGG GGG AAG AGG AGG AGA GAG AAG 
CAG AGA CCC ACA GAC TGC CAC CTG TGC GGC GAT GCT GTT CCC 
CGG AGG TAA. SEQ ID NOS:29 & 40 

6 Vascular endothelial cell growth factor AC DNA comprising an A subunit DNA sequence selected form 
the group consisting ok a DNA sequence encoding an 1 89 amino acid form, a DNA sequence ^encoding 
an 165 amino acid form and a DNA sequence encoding a 121 amino acid form, with said A subunit DNA 
operably attached to a C subunit DNA sequence. 

7 Vascular endothelial cell growth factor BC DNA comprising a 8 subunit DNA sequence selected form the 
qroup consisting of a DNA sequence encoding a 135 amino acid form; and a DNA sequence encoding a 
115 amino acid form, with said B subunit DNA sequence operably attached to a C subunit DNA sequence. 

8. Homodimeric vascular endothelial growth factor DNA comprising C subunit DNA sequences. 

9. A vector containing the DNA sequence of any one of claims 3 to 8. 

10. A host cell transformed by the vector of claim 9 containing the DNA sequence encoding vascular endothe- 
lial cell growth factor. 

11 A process for the preparation of vascular endothelial cell growth factor comprising culturing the transfor- 
' med host cell of claim 10 under conditions suitable for the expression of vascular endothelial cell growth 
factor and recovering vascular endothelial cell growth factor. 

12. Vascular endothelial growth factor made by the process of claim 1 1 . 

13. Vascular endothelial cell growth factor AC comprising an A subunit amino acid sequence and a C subunit 
amino acid sequence. 

■ 14. Vascular endothelial cell growth factor BC comprising a B subunit amino acid sequence and a C subunit 
amino acid sequence. 

15. Vascular endothelial cell growth factor CC comprising a C subunit amino acid sequence and a C subunit 
amino acid sequence. 

16. A purified and isolated vascular endothelial cell growth factor 0 subunit amino acid sequence comprising: 
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£ fro Val Met Arg Le u Phe Pro ^ Phe ^ Gln ^ 
Ala Gly Leu Ma i* u Pr0 ^ Val pro pro ^ ^ 

leu Ser Ala Gly ten Gly Ser Ser Glu Val Glu Val Val P , 

Z S" ^ TIP G1V *** * r ^ <*» ** Ala Leu Q 

S 2? ^ ^ au ^ Pr ° slu ^1 Glu 

His Met Phe Ser Pro Ser Cys Val Ser Leu leu Arg Cys Tor 

Gly Cys Cys Gly Asp Glu Asn Leu His Cys Val S Val Glu 
Thr Ala Asn Val Thr Met Gin Leu Leu Lys lie Arg Ser Gly 
Asp Arg Pro Ser Tyr Val Glu leu Thr Phe Ser gS His S 
Arg Cys Glu Cys Arg Pro Leu Arg Glu Lys Met Lys Pro Glu 
Zm^n ^ ^ ^ ^ ^ Arg Arg Arg Glu Lys. 

17. A m «.d and feolatad vaacular andomiial can arow* factor c aobnni, amino add seouenc eompnaing: 

Ala Gly leu Ala Leu Pro Ala Val Pro Pro Gin Gin Trp Ala 
£ Ser Ala Gly Asn Gly Ser Ser Glu Val Glu Val VaT £o 
Phe Gin Glu Val Trp Gly Ar, Ser Tyr Cys Arg Ala Leu g'o 

^zt^izz^*^ 

Thr Ala Asn Val Thr tfet Gin Leu Leu Lys lie Arg Ser G'v 

S C? IT T ^ G1U ^ ^ ^ Ss £ 
Arg Cys Glu Cys Arg Pro Leu Arg Glu Lys Met Lys Pro G'u 
Arg Arg Arg Pro Lys Gly ^ Gly ^ ^ 

Gin Arg Pro mr Asp cys His Leu Cys Gly Asp All Val Pro 
Arg Arg. SEQ ID HQS: 29 s 40 r ° 

raping am™,™ or «» wtrfi , a vaacula, ,ndoth.l(al grc«h factor of an, on. of data 13 »7s 

w ' «.z^:r^on^ 
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ATG AAC TTT CTG CTC TCT TOG GTG CAC TGG ACC CTG GCT TTA CTG 49 
Mat Asn Phe Leu Leu Ser Trp Val His Trp Thr Leu Ala Leu Leu 
5 10 15 

CTG TAC CTC CAC CAT GCC AAG TOG TOC CAG GCT OCA CCC AOG ACA 94 
Leu Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala Pro Thr Thr 
20 25 30 

GAA GGG GAG CAG AAA GCC CAT GAA GTG GTG AAG TTC ATG GAC GTC 139 
Glu Gly Glu Gin Lys Ala His Glu Val Val Lys Phe Met Asp Val 
35 40 45 

TAC CAG CGC AGC TAT TGC CGT CCG ATT GAG ACC CTG GTG GAC ATC 184 
Tyr Gin Arg Ser Tyr Cys Arg Pro He Glu Thr Leu Val Asp He 
50 55 S) 

TTC CAG GAG TAC CCC GAT GAG ATA GAG TAT ATC TTC AAG CCG TCC 229 
Phe Gin Glu Tyr Pro Asp Glu He Glu Tyr He Phe Lys Pro Ser 
65 70 "75 

TGT GTG CCC CTA ATG CGG TGT GCG GGC TGC TGC AAT GAT GAA GCC 274 
Cys Val Pro Leu Met Arg Cys Ala Gly Cys Cys Asn Asp Glu Ala 
80 85 90 

CTG GAG TGC GTG CCC ACG TOG GAG AGC AAC GTC ACT ATG CAG ATC 319 
Leu Glu Cys Val Pro Thr Ser Glu Ser Asn Val Thr Met Gin He 
95 100 105 

ATG CGG ATC AAA CCT CAC CAA AGC CAG CAC ATA GGA GAG ATG AGC 364 
Met Arg He Lys Pro His Gin Ser Gin His He Gly Glu Met Ser 
110 115 120 

TTC CTG CAG CAT AGC AGA TGT GAA TGC AGA CCA AAG AAA GAT AGA 409 
Phe Leu Gin His Ser Arg Cys Glu Cys Arg Pro Lys Lys Asp Arg 
125 130 135 



ACA AAG CCA GAA AAA TGT GAC AAG CCA AGG CGG TGA 
Thr Lys Pro Glu Lys Cys Asp Lys Pro Arg Arg 
140 145 



445 



FIG. 4 
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Met Asn Phe la, Leu Ser Trp Val His Trp Thr ™ S S S 
5 10 15 

^ HlS Ma ^ Trp Ser Gin Ala Ala Pro Thr Thr 
20 25 a) 

35 40 45 

65 7 0 75 

i?? AT ^ TGT GCG QQC TGC TQC AAT GAT GAA rrr 

S Val Pro ^ Arg Cy S Ala Gly Cys Cys^ Asn Asp Glu Ma 
80 85 go 

CTG GAG TGC GTG COC AOG TCG GAG AQC AAC GTC ACT ATT far air' tia 
*u Glu C^s Va! Pro Tfc r ser Glu Ser S £ Gin £ 

95 100 105 

K££222£222£2££" 

110 H5 120 

125 130 * 135 

?* 2* MT ^ ^ ^ OT TOT TCA GAG CGG AGA AAG 4 54 
Thr Lys Pro Glu Asn His Cys Glu Pro Cys Ser Glu Arg Arg Lys 

145 150 

155 160 165 

A^T^S? J? ^ 002 OG CTT GAG TTA AAC GAA 544 
Asn Thr Asp Ser Arg Cys Lys Ala Arg Gin Leu Glu Leu Asn Glu 

170 175 180 



CGTACTTGCAGATGTGACAAGCCAAGGCGGTGA 
Arg Thr Cys Arg Cys Asp Lys Pro Arg Arg 
185 190 



577 



FIG. 5 
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AAOC ATG AAC ITT CTG CTC TCT TOG GTG CAC TOG ADC CTG GCT TTA CTG 49 
Met Asn Phe Leu Leu Ser Trp Val His Trp Thr Leu Ala Leu Leu 
5 10 15 

CTG TAC CTC CAC CAT GCC AAG TGG TCC CAG GCT GCA CCC ACG ACA 94 
Leu Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala Pro Thr Thr 
20 25 30 

GAA GGG GAG CAG AAA GCC CAT GAA GTG GTG AAG TTC ATG GAC GTC 139 
Glu Gly Glu Gin Lys Ala His Glu Val Val Lys Phe Met Asp Val 
35 40 45 

TAC CAG CGC AGC TAT TGC CGT CCG ATT GAG ACC CTG GTG GAC ATC 184 
Tyr Gin Arg Ser Tyr Cys Arg Pro He Glu Thr Leu Val Asp He 
50 55 © 

TTC CAG GAG TAC COC GAT GAG ATA GAG TAT ATC TTC AAG CCG TCC 229 
Phe Gin Glu Tyr Pro Asp Glu He Glu Tyr He Phe Lys Pro Ser 
65 "70 "75 

TGT GTG CCC CTA ATG CGG TGT GCG GGC TGC TGC AAT GAT GAA GCC 274 
Cys Val Pro Leu Met Arg Cys Ala Gly Cys Cys Asn Asp Glu Ala 
80 85 90 

CTG GAG TGC GTG CCC ACG TOG GAG AGC AAC GTC ACT ATG CAG ATC 319 
Leu Glu Cys Val Pro Thr Ser Glu Ser Asn Val Thr Met Gin He 
95 100 105 

ATG CGG ATC AAA OCT CAC CAA AGC CAG CAC ATA GGA GAG ATG AGC 364 
Mst Arg He Lys Pro His Gin Ser Gin His He Gly Glu Met Ser 
110 115 120 

TTC CTG CAG CAT AGC AGA TGT GAA TGC AGA CCA AAG AAA GAT AGA 409 
Phe Leu Gin His Ser Arg Cys Glu Cys Arg Pro Lys Lys Asp Arg 
125 130 135 

ACA AAG CCA GAA AAA AAA TCA GTT CGA GGA AAG GGA AAG GGT CAA 454 
Thr Lys Pro Glu Lys Lys Ser Val Arg Gly Lys Gly Lys Gly Gin 
140 145 150 

AAA CGA AAG CGC AAG AAA TCC CGG TTT AAA TCC TGG AGC GTT CAC 499 
Lys Arg Lys Arg Lys Lys Ser Arg Phe Lys Ser Trp Ser Val His 
155 160 165 



FIG. 6A 
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£^£^ 2? ^ TO TO ™ OA GAT 544 

Cys Glu Pro Cys iter Glu Arg Arg Lys His leu Phe Val GLri Asp 

170 175 1B f 

200 . 205 21 f 

AAG CCA AGG CGG TGA 
Lys Pro Arg Arg 



649 



FI6.6B 
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ATG CTG GOC ATG AAG CTG TTC ACT TGC TTC TTG CAG GTC CTA GCT 
Met Leu Ala Met Lys Leu Phe Thr Cys Phe Leu Gin Val Leu Ala 
5 10 15 



GGG TTG GCT GIG CAC TCC CAG GOG GOC CTG TCT OCT QQG AAC AAC 
Gly Leu Ala Val His Ser Gin Gly Ala Leu Ser Ala Gly Asn Asn 
20 25 3) 



45 



90 



TCA ACA GAA ATG GAA GTG GTG CCT TTC AAT GAA GTG TGG QQC CGC 
Ser Thr Glu Met Glu Val Val Pro Phe Asn Glu Val Trp Gly Arg 
35 40 45 

AGC TAC TGC CGG CCA ATG GAG AAG CTG GTG TAC ATT GCA GAT GAA 
Ser Tyr Cys Arg Pro Met Glu Lys Leu Val Tyr He Ala Asp Glu 
50 55 © 

CAC CCT AAT GAA GTG TCT CAT ATA TTC ACT CCG TCA TGT GTC CTT 
His Pro Asn Glu Val Ser His He Phe Ser Pro Ser Cys Val Leu 
65 70 75 

CTG AGT CGC TGT AGT GGC TGC TGT GGT GAC GAG GGT CTG CAC TGT 
Leu Ser Arg Cys Ser Gly Cys Cys Gly Asp Glu Gly Leu His Cys 
80 85 a 

GTG GOG CTA AAG ACA GCC AAC ATC ACT ATG CAG ATC TTA AAG ATT 
Val Ala Leu Lys Thr Ala Asn lie Thr Met Gin lie Leu Lys lie 
95 100 105 

CCC CCC AAT CGG GAT CCA CAT TCC TAC GTG GAG ATG ACA TTC TCT 
Pro Pro Asn Arg Asp Pro His Ser Tyr Val Glu Met Thr Phe Ser 
110 H5 " u 

CAG GAT GTA CTC TGC GAA TGC AGG CCT ATT CTG GAG ACG ACA AAG 
Gin Asp Val Leu Cys Glu Cys Arg Pro lie Leu Glu Thr Thr Lys 
125 130 135 

GCA GAA AGG TAA 
Ala Glu Arg 



135 
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405 



417 



FIG.7 
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10 35 
* 40 ^ 

65 70 ^ 

ioo 105 

115 120 

130 135 

14^ ™r Lys Gly Lys Arg Lys Gin Ser Lys Thr 

145 150 

C^OGACTGAGGMcCCCACCrcrcft 
Pro Gin Thr Glu Glu Pro His Leu 

155 
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90 



135 
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QBG OS QGS CTG CH OCT GIG (XT « M M TO g TO g 
Gly Leu Ala Leu Pro Ala Val Pro Pro Gin Gin Trp Ala Leu ser 
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OCT GGG AAC GGC TOG TCA GAG GTG GAA GTG GTA CCC TTC CAG GAA 
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v£ S S S S ^ Cys Ar, Ala U. G1 U Ar, I. Val ft* 
50 55 

CTC GTG TOC GAG TAC CCC AGC GAG GTG GAG CAC ATG TTC AGC CCA 
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